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® In a superscalar processor system which is ca- 
pable of accessing multiple instructions simulta- 
neously, a branch history table (60) is established 
which includes a predictive field (62-76) for each 
possible instruction fetch position within a multi- 
instruction access. Each group of predictive fields 
(62-76) is accessed within the branch history table 
(60) utilizing a portion of the instruction fetch ad- 
dress (56,58). A particular predictive field (62-76) 
within the group is then selected which corresponds 
to the position of the branch instruction within the 
instruction fetch. The content of the selected predic- 
tive field (62-76) is then utilized to predict whether or 
not a branch is taken for the corresponding branch 
instruction. Each predictive field (62-76) preferably 
comprises a two bit binary counter which is incre- 
mented or decremented in response to a branch 
being "taken" or "not taken". 
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The present invention relates in general to an 
improved data processing system and in particular 
to a method and system for improved branch his- 
tory prediction in a data processing system. Still 
more particularly, the present Invention relates to 
an improved method and system for enhanced 
branch history prediction in a superscalar proces- 
sor system which is capable of simultaneously 
dispatching a plurality of instructions. 

Designers of modern state-of-the-art data pro- 
cessing systems are continually attempting to en- 
hance the performance aspects of such systems. 
One technique for enhancing data processing sys- 
tem efficiency is the achievement of short cycle 
times and a low Cycles-Per-lnstruction (CPI) ratio 
An example of the application of these techniques 
to an enhanced data processing system is the 
International Business Machines Corporation RISC 
System 6000 (RS/6000) computer. The RS/6000 
system is designed to perform well in numerically 
intensive engineering and scientific applications as 
well as in multi-user, commercial environments. 
The RS/6000 processor employs a superscalar im- 
plementation, which means that multiple instruc- 
tions are issued and executed simultaneously. 

The simultaneous issuance and execution of 
multiple instructions requires independent function- 
al units that can execute concurrently with a high 
instruction bandwidth. The RS/6000 systems 
achieves this by utilizing separate branch, fixed 
point and floating point processing units which are 
pipelined in nature. In such systems a significant 
pipelined delay penalty may result from the execu- 
tion of a so-called "conditional branch" instruction. 
Conditional branch instructions are instructions 
which dictate the taking of a specified conditional 
branch within an application in response to a se- 
lected outcome of the processing of one or more 
other instructions. Thus, by the time a conditional 
branch instruction propagates through a pipeline 
queue to an execution position within the queue, it 
will have been necessary to load instructions into 
the queue behind the conditional branch Instruction 
prior to resolving the conditional branch, in order to 
avoid run-time delays. 

One attempt at minimizing this run-time delay 
in pipelined processor systems involves the provi- 
sion of an alternate instruction queue. Upon the 
detection of a conditional branch instruction within 
the primary instruction queue, the sequential 
instructions following the conditional branch instruc- 
tion within the queue are immediately purged and 
loaded into the alternate instruction queue. Target 
instructions for a predicted conditional branch are 
then fetched and loaded into the primary instruc- 
tion queue. If the predicted conditional branch does 
not occur, the sequential instructions are fetched 
from the alternate instruction queue and loaded into 



the primary instruction queue. While this technique 
minimizes run-time delay, it requires the provision 
of an alternate instruction queue and a concomitant 
increase in the hardware assets required. 

6 Another attempt at minimizing run-time delay in 

pipelined processor systems Involves the utilization 
of a compiler to insert large numbers of instruc- 
tions into the queue between a conditional branch 
instruction and the instruction which generates the 

10 outcome which initiates the conditional branch. This 
technique attempts to resolve the conditional 
branch and place the appropriate target instructions 
or sequential instructions into the instruction queue 
prior to execution of the conditional branch instruc- 
ts tion during the delay between execution of the 
instruction which generates the outcome which ini- 
tiates the conditional branch and the execution of 
the conditional branch instruction. In theory, this 
technique will minimize run-time delay without re- 

20 quiring the provision of an alternate instruction 
queue; however, it is often difficult to insert suffi- 
cient numbers of instructions into the queue to 
accomplish the necessary delay. 

As a consequence, the efficiency of a data 

25 processing system may be enhanced by accu- 
rately predicting whether or not the application will 
branch to a specified conditional branch in re- 
sponse to encountering a branch instruction. One 
technique for predicting whether or not a particular 

30 branch will be "taken" or "not taken" is the utiliza- 
tion of so-called "Branch History Tables" (BHT). A 
Branch History Table is utilized to store the recent 
history of a particular branch instruction In order to 
accurately predict whether or not the execution of 

35 that instruction will result in a branch within an 
executing set of Instructions. While the provision of 
branch history tables provides a relatively straight- 
forward technique for minimizing run-time delay, 
problems exist in superscalar processor systems 

40 wherein multiple instructions are fetched within a 
single access. Depehding upon the pipeline in such 
a system the branch address may not be known 
until the target address calculation. If the branch 
address Is not known until the target address cal- 

45 culation and the branch prediction is to be done in 
parallel with the address calculation then branch 
history table access becomes quite critical. 

It should therefore be apparent that a need 
exists for an improved method and system for 

50 predicting the outcome of a branch instruction uti- 
lizing a branch history table. 

The above drawbacks of the prior art are over- 
come by the invention as claimed. 

It is one object of the present invention to 

55 provide an improved data processing system. 

It is another object of the present invention to 
provide a method and system for improved branch 
history prediction in a data processing system. 
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It is yet another object of the present invention 
to provide an Improved nnethod and system for 
enhanced branch history prediction in a super- 
scalar data processing system which is capable of 
simultaneously dispatching a plurality of instruc- 
tions. 

The foregoing objects are achieved as is now 
described. The method and system of the present 
invention may be utilized to enhance branch history 
prediction accuracy In a superscalar processor sys- 
tem by maintaining branch history tables which 
include a separate branch history for each instruc- 
tion fetch position within a multi-instruction access. 
In a superscalar processor system which is ca- 
pable of accessing multiple instructions simulta- 
neously, a branch history table is established which 
includes a predictive field for each possible instruc- 
tion fetch position within a multi-instruction access. 
Each group of predictive fields is accessed within 
the branch history table utilizing a portion of the 
instruction fetch address, such as the low order 
address bits. A particular predictive field within the 
group is then selected which corresponds to the 
position of the branch instruction within the instruc- 
tion fetch. The content of the selected predictive 
field is then utilized to predict whether or not a 
branch is taken for the corresponding branch in- 
struction. Each predictive field preferably com- 
prises a two bit binary counter which is incre- 
mented or decremented in response to a branch 
being "taken" or "not taken" and then stored again 
within the branch history table. 

The above as well as additional objectives, 
features, and advantages of the present invention 
will become apparent in the following detailed writ- 
ten description. 

The novel features believed characteristic of 
the invention are set forth in the appended claims. 
The invention itself, however, as well as a preferred 
mode of use, further objectives and advantages 
thereof, will best be understood by reference to the 
following detailed description of an illustrative em- 
bodiment when read in conjunction with the accom- 
panying drawings, wherein: 

Figure 1 is a high level block diagram of a 
superscalar data processing system which may 
be utilized to implement the method and system 
of the present invention; 

Figures 2a and 2b are schematic representa- 
tions of instruction queues which depict two 
examples of a simultaneous fetch of multiple 
instructions in accordance with the method and 
system of the present invention; 
Figure 3 is a schematic representation of a 
branch history table wherein each entry includes 
multiple predictive fields, in accordance with the 
method and system of the present invention; 



Figure 4 is a table depicting the content of each 
possible predictive field within the branch his- 
tory table of Figure 3 and an associated predic- 
tion; and 

5 Figure 5 is a logic flowchart illustrating the pro- 
cess of branch prediction in accordance with the 
method and system of the present invention. 
With reference now to the figures and in par- 
ticular with reference to Figure 1 , there is depicted 
w a high level block diagram of a superscalar com- 
puter system 10 which may be utilized to imple- 
ment the method and system of the present inven- 
tion. As illustrated, computer system 10 preferably 
includes a memory 18 which is utilized to store 
75 data. Instructions and the like. Data or instructions 
stored within memory 18 are preferably accessed 
utilizing cache/memory interface 20 in a method 
well known to those having skill in the art. The 
sizing and utilization of cache memory systems is 
20 a well known subspecialty within the data process- 
ing art and not addressed within the present ap- 
plication. However, those skilled in the art will ap- 
preciate that by utilizing modern associative cache 
techniques a large percentage of memory access- 
25 es may be achieved utilizing data temporarily 
stored within cache/memory interface 20. 

Instructions from cache/memory interface 20 
are typically loaded into instruction queue 22 which 
preferably Includes a plurality of queue positions. 
30 In a typical embodiment of a superscalar computer 
system the instruction queue may include eight 
queue positions and thus, in a given cycle, be- 
tween zero and eight instructions may be loaded 
into instruction queue 22. depending upon how 
35 many valid instructions are passed by 
cache/memory interface 20 and how much space is 
available within instruction queue 22. 

As is typical in such superscalar computer 
systems, instruction queue 22 is utilized to dis- 
40 patch instructions to multiple execution units. As 
depicted within Figure 1, computer system 10 in- 
cludes a floating point processor unit 24. a fixed 
point processor unit 26, and a branch processor 
unit 28. Thus, instruction queue 22 may dispatch 
45 between zero and three instructions during a single 
cycle, one to each execution unit. 

In addition to sequential instructions dispatched 
from instruction queue 22, so-called "conditional 
branch instructions " may be loaded into instruc- 
50 tion queue 22 for execution by the branch proces- 
sor. A conditional branch instruction is an instruc- 
tion which specifies an associated conditional 
branch to be taken within the application in re- 
sponse to a selected outcome of processing one or 
65 more sequential instructions. In an effort to mini- 
mize run-time delay in a pipelined processor sys- 
tem, such as computer system 10. the presence of 
a conditional branch instruction within the instruc- 
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tion queue is detected and an outcome of the 
conditional brancli is predicted. As should be ap- 
parent to those having skill in the art when a 
conditional branch is predicted as "not taken" the 
sequential instructions within the Instruction queue 
simply continue along a current path and no 
instructions are altered. However, if the prediction 
as to the occurrence of the branch is incorrect, the 
instruction queue must be purged of sequential 
instructions, which follow the conditional branch 
instruction in program order and target instructions 
must be fetched. Alternately, if the conditional 
branch Is predicted as "taken" then the target 
instructions are fetched and utilized to follow the 
conditional branch, if the prediction is resolved as 
correct. And of course, if the prediction of "taken" 
is incorrect the target instructions must be purged 
and the sequential instructions which foltow the 
conditional branch instruction in program order 
must be retrieved. 

As illustrated, computer system 10 also prefer- 
ably includes a condition register 32. Condition 
register 32 is utilized to temporarily store the re- 
sults of various comparisons which may occur uti- 
lizing the outcome of sequential instructions which 
are processed within computer system 10. Thus, 
floating point processor unit 24, fixed point proces- 
sor unit 26 and branch processor unit 28 are all 
coupled to condition register 32. The status of a 
particular condition within condition register 32 may 
be detected and coupled to branch processor unit 
28 in order to generate target addresses, which are 
then utilized to fetch target instructions in response 
to the occurrence of a condition which initiates a 
branch. 

Thereafter, branch processor unit 28 couples 
target addresses to fetcher 30. Fetcher 30 cal- 
culates fetch addresses for the target instructions 
necessary to follow the conditional branch and cou- 
ples those fetch addresses to cache/memory inter- 
face 20. As should be appreciated by those having 
skill in the art, if the target instructions associated 
with those fetch addresses are present within 
cache/memory interface 20, those target instruc- 
tions are loaded into instruction queue 22. Alter- 
nately, the target instructions may be fetched from 
memory 18 and thereafter loaded into instruction 
queue 22 from cache/memory interface 20 after a 
delay required to fetch those target instructions. 

Referring now to Figures 2a and 2b. there are 
depicted schematic representations of two exam- 
ples of an instruction queue depicting a simulta- 
neous fetch of multiple instructions in accordance 
with the method and system of the present inven- 
tion. As illustrated within Figure 2a, an ordered 
sequence of instructions may be fetched from in- 
struction cache 40 utilizing a fetch address 38. 
These instructions are then loaded into instruction 



queue 42 as depicted. Thus, for a selected fetch 
address 30 instructions 1, 2, 3 and 4 may be 
loaded within an ordered sequence into instruction 
queue 42 at positions 44, 46. 48, and 50. 

6 Referring now to Figure 2b. it may be seen that 

a multi-instruction fetch which utilizes an alternate 
fetch address 38 may result in an ordered se- 
quence of instructions being loaded from instruc- 
tion cache 40 into instruction queue 42 such that 

10 instructions 3, 4. 5. and 6 are loaded into instruc- 
tion queue 42 at positions 44. 46, 48 and 50, as 
illustrated. Thus, it may be seen that while instruc- 
tions 3 and 4 are loaded into instruction queue 42 
at positions 48 and 50 within Figure 2a, the utiliza- 

75 tion of an alternate fetch address to access an 
ordered sequence of instructions may result in 
instructions 3 and 4 being loaded into instruction 
queue 42 at positions 44 and 46. 

The Applicant has discovered that variations in 

20 the relative position of an instruction within a multi- 
instruction access often have a substantial impact 
upon how a branch instruction at that position 
should be predicted. Thus, in accordance with the 
method and system of the present invention, a 

25 separate entry within a branch history table is pro- 
vided for each instruction. That entry will vary in 
response to variations in the relative position of the 
instruction, with respect to the other instructions in 
a multi-instruction access. 

30 This variation in branch history prediction is 

best illustrated with respect to Figure 3 which de- 
picts a schematic representation of a branch his- 
tory table wherein each entry includes multiple 
predictive fields, in accordance with the method 

35 and system of the present invention. As depicted, a 
branch history table 60 is illustrated. Branch history 
table 60 is preferably accessed utilizing a portion 
of the fetch address which is utilized to access a 
group of instructions from instruction cache 40 (see 

40 Figures 2a and 2b) such as. for example, the low 
order bits thereof. Fetch address bits 56, when 
utilized to access branch history table 60. will result 
in the accessing of multiple predictive fields. As 
illustrated, predictive fields A. B. C and D are thus 

45 accessed within branch history table 60 from posi- 
tions 62, 64. 66. and 68 respectively. Therefore, as 
illustrated, in a system which may simultaneously 
access and dispatch four instructions, each entry 
within branch history table 60 includes four sepa- 

60 rate predictive fields, each associated with an in- 
struction at a corresponding location within a group 
of four instructions which have been accessed. 

Thus, when utilizing an alternate fetch address, 
as depicted at reference numeral 58, an entirely 

65 different group of predictive fields are accessed. 
That is, predictive fields L. M, N, and O are acces- 
sed from within branch history table 60 at locations 
70, 72. 74, and 76. 
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In the manner described herein, each entry 
within branch history table 60 includes a group of 
predictive fields, each predictive field within the 
group corresponding to a particular instruction po- 
sition within a multi-instruction access, such as 
depicted within Figures 2a and 2b. In this manner, 
a separate branch history may be maintained for 
each instruction which is correlated to the position 
of the instruction within a multi-instruction access, 
since the relative position of an instruction within a 
mufti-instruction access often indicates an impor- 
tant function of that instruction's likely branch pre- 
diction. 

Referring now to Rgure 4, there is depicted a 
table which illustrates the content of each predic- 
tive field within branch history table 60 of Figure 3. 
In the depicted embodiment of the present inven? 
tion, each predictive field within a group of predic- 
tive fields within the branch history table of Figure 
3 may comprise a two bit binary counter which is 
selectively incremented or decremented in re- 
sponse to the execution of a conditional branch 
instruction resulting in the branch being "tal<en" or 
"not talcen," in a manner common in the prior art. 
As depicted within column 92, there are four possi- 
ble predictive field contents. If the predictive field 
contains the binary value "0 0" the prediction is a 
strong prediction that the branch is "not taken." If 
the predictive field content is a binary value of "0 
1 " this comprises a weak prediction that execution 
of the branch instruction wilt result in the branch 
being "not taken." Preceding through column 92. a 
binary content within the predictive field of "1 0" 
comprises a weak prediction that the branch in- 
struction will result in a branch being "taken" and 
finally, a binary value of "1 1 " comprises a strong 
prediction that execution of the branch instruction 
will result in the branch being "taken." 

Thus, in a manner which will be explained in 
greater detail below, by selectively incrementing or 
decrementing the content of an individual predic- 
tive field, the prediction of whether or not a branch 
associated with the branch instruction which cor- 
relates to that predictive field is "taken" or "not 
taken" may be selectively altered to reflect actual 
experience with that branch instruction. 

Rnally, with reference to Figure 5, there is 
depicted a logic flowchart which illustrates the pro- 
cess of branch prediction in accordance with the 
method and system of the present invention. As 
depicted, the process begins at block 100 and 
thereafter passes to block 102. Block 102 illustrates 
a determination of whether or not an instruction 
fetch has occurred. If not, the process merely it- 
erates until such time as an instruction fetch oc- 
curs. Next, still referring to block 102, if a multi- 
instruction fetch has occurred, the process passes 
to block 104. Block 104 illustrates the utilization of 



the low order fetch address bits to access branch 
history table 60 (see Figure 3). Thereafter, the 
process passes to block 106. Block 106 illustrates 
the reading out of a group of predictive fields from 
5 within branch history table 60. at a point within 
branch history table 60 which is indicated by the 
low order fetch address bits. 

Next, the process passes to block 108. Block 
108 illustrates the selection of a particular predic- 
10 tive field from within the group of predictive fields 
accessed utilizing the low order fetch address bits. 
As described above, the predictive field selected is 
that predictive field which corresponds to the 
branch instruction relative position within the mul- 
15 tiple instructions within the instruction queue. Thus, 
for example, if the branch instruction has been 
fetched "to the first position within the instruction 
queue, the first predictive field will be utilized. 
Similarly, if the branch instruction has been fetched 
20 to. the third position within the instruction queue, 
the third predictive field will be utilized. 

Next, the process passes to block 110 which 
illustrates the predicting of the branch. This is 
accomplished by assigning a prediction to whether 
25 or not execution of the branch instruction will result 
in a branch being "taken" or "not taken" by utiliz- 
ing the prediction associated with the predictive 
field content, as depicted within Figure 4. There- 
after, the process passes to block 112. Block 112 
30 illustrates a determination of whether or not execu- 
tion of the branch instruction has actually resulted 
in a branch being "taken." If so. the process 
passes to block 114. which illustrates the incre- 
menting of the predictive field value and the writing 
35 of a new value for the predictive field to the branch 
history table. Alternately, if execution of the branch 
instruction has not resulted in the branch being 
"taken" the process passes to block 116. Block 
116 illustrates the decrementing of the predictive 
40 field value, and the writing of the new value for the 
predictive field to the branch history table. There- 
after, or after incrementing the predictive field value 
the process jDasses to block 118 and returns, to 
await the next instruction fetch. Of course, incre- 
45 menting or decrementing a predictive field value 
will have no effect if the content of that field is 
already at a maximum or minimum value, respec- 
tively. 

Upon reference to the foregoing those skilled in 
50 the art will appreciate that by providing a separate 
predictive field associated with each relative in- 
struction position within a multi-instruction fetch, 
the accuracy of branch history prediction may be 
greatly enhanced in view of the discovery of the 
55 Applicant that the likelihood of a branch being 
"taken" is affected by those circumstances which 
alter the relative position of the branch instruction 
within a multi-instruction access. 
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Claims 

1. A method for enhanced branch history predic- 
tion accuracy in a superscalar processor sys- 
tem (10) which is capable of fetching and 
dispatching up to N instructions simultaneous- 
ly, said method comprising the steps of: 

establishing a branch history table (60) 
containing multiple predictive fields (62-76), 
each of said multiple predictive fields (62-76) 
containing data (92) indicative of a likelihood 
that execution of a particular associated in- 
struction will result in a branch within an ex- 
ecuting set of instructions; 

accessing (104) a selected point within 
said branch history table (60) utilizing at least a 
portion of a fetch address (56,58) which is 
utilized to access an ordered sequence of M 
instructions within said superscalar processor 
system (10); 

reading out (106) an ordered sequence of 
M predictive fields (62-76) from said selected 
point within said branch history table (60), each 
one of said ordered sequence of M predictive 
fields (62-76) being associated with a particular 
instruction at a corresponding position within 
said ordered sequence of M instructions; and 

utilizing (108,110) an associated one of 
said M predictive fields (62-76) to determine a 
likelihood that execution of a corresponding 
instruction within said ordered sequence of M 
instructions will result in a branch within said 
executing set of instructions, wherein variations 
in relative position of an instruction within said 
ordered sequence of M instructions will result 
in utilization of an alternate predictive field (62- 
76), thereby enhancing predicative accuracy. 

2. The method for enhanced branch history 
predicative accuracy in a superscalar proces- 
sor (10) according to Claim 1, wherein said 
step of establishing a branch history table (60) 
containing multiple predictive fields (62-76) 
comprises the step of establishing a branch 
history table (60) containing multiple predictive 
fields (62-76) wherein each predictive field (62- 
76) comprises a two bit binary counter (92). 

3. The method for enhanced branch history 
predicative accuracy in a superscalar proces- 
sor (10) according to Claim 2. further including 
the step of incrementing (114) said two bit 
binary counter (92) each time execution of said 
particular associated instruction results in a 
branch within said executing set of instructions. 

4. The method for enhanced branch history 
predicative accuracy in a superscalar proces- 



sor (10) according to Claim 2 or 3. further 
including the step of decrementing (116) said 
two bit binary counter (92) each time execution 
of said particular associated instruction does 
5 not result in a branch within said executing set 

of instructions. 

5. The method for enhanced branch history 
predicative accuracy in a superscalar proces- 

10 sor (10) according to any Claim from 1 to 4. 

wherein said step of accessing (104) a se- 
lected point within said branch history table 
(60) utilizing at least a portion of a fetch ad- 
dress (56.58) which is utilized to access an 

IS ordered sequence of M instructions within said 

superscalar processor system (10) comprises 
accessing (104) a selected point within said 
branch history table (60) utilizing low order 
address bits of a fetch address (56,58) which 

20 is utilized to access an ordered sequence of M 

instructions within said superscalar processor 
system (10). 

6. A system for enhanced branch history predic- 
25 tion accuracy in a superscalar processor sys- 
tem (10) which is capable of fetching and 
dispatching up to N instructions simultaneous- 
ly, said system comprising: 

means for establishing a branch history 

30 table (60) containing multiple predictive fields 

(62-76). each of said multiple predictive fields 
(62-76) containing data (92) indicative of a like- 
lihood that execution of a particular associated 
instruction will result in a branch within an 

35 executing set of instructions; 

means for accessing (104) a selected point 
within said branch history table (60) utilizing at 
least a portion of a fetch address (56,58) which 
is utilized to access an ordered sequence of M 

40 instructions within said superscalar processor 

system (10); 

means for reading out (106) an ordered 
sequence of M predictive fields (62-76) from 
said selected point within said branch history 

45 table (60), each one of said ordered sequence 

of M predictive fields (62-76) being associated 
with a particular instruction at a corresponding 
position within said ordered sequence of M 
instructions; and 

60 means for utilizing (108,110) an associated 

one of said M predictive fields (62-76) to deter- 
mine a likelihood that execution of a corre- 
sponding instruction within said ordered se- 
quence of M instructions will result in a branch 

55 within said executing set of instructions. 

wherein variations in relative position of an 
instruction within said ordered sequence of M 
instructions will result in utilization of an al- 
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ternate predictive field (62-76), thereby en- 
hancing predicative accuracy, 

7. The system for enhanced branch history predi- 
cative accuracy in a superscalar processor (10) 5 
according to Clainn 6, wherein said means for 
establishing a branch history table (60) con- 
taining multiple predictive fields (62-76) com- 
prises means for establishing a branch history 
table (60) containing multiple predictive fields io 
(62-76) wherein each predictive field (62-76) 
comprises a two bit binary counter (92). 

8. The system for enhanced branch history predi- 
cative accuracy in a superscalar processor (10) is 
according to Claim 7, further including means 

for Incrementing (114) said two bit binary 
counter (92) each time execution of said par- 
ticular associated instruction results in a 
branch within said executing set of instructions. 2q 

9. The method for enhanced branch history 
predicative accuracy in a superscalar proces- 
sor (10) according to Claim 7 or 8, further 
including means for decrementing (116) said 25 
two bit binary counter (92) each time execution 

of said particular associated instruction does 
not result in a branch within said executing set 
of instructions. 

30 

10. The system for enhanced branch history predi- 
cative accuracy in a superscalar processor (10) 
according to any Claim from 6 to 9, wherein 
said means for accessing (104) a selected 
point within said branch history table (60) utiliz- 35 
ing at least a portion of a fetch address (56.58) 
which is utilized to access an ordered se- 
quence of M instructions within said super- 
scalar processor system (10) comprises means 

for accessing (104) a selected point within said 4q 
branch history table (60) utilizing low order 
address bits of a fetch address (56,58) which 
is utilized to access an ordered sequence of M 
instructions within said superscalar processor 
system (10). 45 



50 



55 



7 



EP 0 605 876 A1 



10 



MEMORY 



->>-18 



30 



24 



FIXED POINT I INSTRUCTIONS 



FLOATING 
POINT UNIT 



FIXED POINT 
UNIT 



3 2^ 





I 



26 



CONDITION 
REGISTER 



28-v. 



20x 


CACHE/MEMORY 


FETCH 
ADDRESSES 


crcxri-ifio 




INTERFACE 










1 






A 


FLOATING POINT 
INSTRUCTIONS 


INSTRUCTION 


BRANCH 
INSTRUCTIONS 




QUEUE 






ft 


1 22-^. 











BRANCH 
UNIT 



8 



EP 0 605 876 A1 



FETCH 
ADDRESS 



L 



40 



3 8 



J 



42 



INSTRUCTION CACHE 










IMSTRUCTION 
1 


INSTRUCTION 
2 


INSTRUCTION 
3 


INSTRUCTION 
4 



44 



46 



INSTRUCTION QUEUE 



-4 8 



50 



ALTERNATE 
FETCH 
ADDRESS 

38 



INSTRUCTION CACHE 



^40 















INSTRUCTION 
3 


INSTRUCTION 
4 


INSTRUCTION 
5 


INSTRUCTION 
6 



INSTRUCTION QUEUE 5 0 



9 



EP 0 605 876 A1 



rtlCH 

ADDRESS 

BITS 



PRE0ICUVtPRCDlCTIYIPR£DICTIvePREOlCTIY£ 



56 



J 



ALHRHATt 
FEICH 
ADDRESS 
BITS 



58 



J 



neio A 



riCLO 8 



FIELD C 



^66 



nCLO D 



V68 



PREDICT iVEPRCOiCTiVEPREDICTIVEPREDICIIVC 



FIELD L 



^-70 



FIELD y 



FIELD N 



FIELD 0 



V7 6 



BRANCH HISI ORY TABLE 



PREDICTIVE 
FIELD CONTENT 



PREDICTION 



0 0 


STRONG - NOT TAKEN 


0 1 


WEAK - NOT TAKEN 


1 0 


WEAK - TAKEN 


1 1 


STRONG - TAKEN 



92 



J 



Fig. 4 



10 



EP 0 60S 876 A1 




100 



use FETCH 
ADDRESS BITS 

TO ACCESS 
BRANCH 
HISTORY TABLE 



I 



^104 



READ GROUP OF 
PREDICTIVE 
FIELDS 



I 



j-106 



SELECT 

PREDICTIVE FIELD 
CORRESPONDING 
TO INSTRUCTION 

POSITION IN QUEUE 



j-t08 



Fig, 5 




INCREMENT 
PREDICTIVE 
FIELD VALUE AND 
WRITE TO BRANCH 
HISTORY TABLE 



DECREMENT 
PREDICTIVE 
FIELD VALUE AND 
WRITE TO BRANCH 
HISTORY TABLE 



^116 



c 



RETURN 



y 



118 



11 



Curopeftn Patent 
Office 



EUROPEAN SEARCH REPORT 



AppUcatfoo Number 

EP 93 12 0944 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Cateffory 



Ckatira of doctunent with indiGatioiu where appropriate, 
of rctevant pa 



RdevMit 
to 



CLASSinCATION OF THE 
APPUCATION (IiitCUS) 



EP-A-0 586 057 (SUM MICROSYSTEMS INC.) 9 
March 1994 

* the whole document * 

EP-A-0 401 992 (ADVANCED MICRO DEVICES 
INC) 

* column 3, lines 10-52; column 4, line 52 
- column 5, line 16; column 5, line 45 - 
column 6» line 38 * 

13TH ANNUAL SYMPOSUIM ON COMPUTER 
ARCHITECTURE 2 June 1986 , TOKYO, JP 
pages 396 - 403 

MCFARLING AND HENNESSY ■ Reducing the cost 
of branches^ 

* page 398, right column, line 40 - page 
399, left column, line 13, figure 1 * 

COMPUTER ARCHITECTURE NEWS 
vol, 19, no. 2 . April 1991 , NEW YORK US 
pages 176 - 188 

D. WALL 'Limits of instruction level 
parallelism' 

* page 177, right column, lines 27-30 * 



1.6 



1.6 



G06F9/38 



2-4.7-lC 



2-4,7-lC 



report ham been Arawn up for all da 



TECHNICAL FUXDS 
SEAKOCED OnLCLS) 



G06F 



S 
I 

s 



THE HAGUE 



21 April 1994 



Weinberg, L 



CATEGORY OF CITED DOCUMENTS 

X : partlculariy ralmnt If taSuat Klotw 

Y : pvtlaUarty rtlwent if coabfaiori wttk uwtbw 

4 o fii iiMnt of tbo saa* cslegpiy 
A : f c h aotoglcil faadvouoi 
O : ao a -w iUU B ttsdosuM 
P : iattmoiiiti 4ociiiB«at 



T : tbmy or prtndpl* iiiii«rtyfns tb« iBventioa 
E : «oril«r patmt riocuiHiit, but piibHsbc4 00, or 

aitm tbo filing 4ett 
D : docoMit dt«4 to tli« afpltesttoa 
L : rfoamoat dtotf for ptlMr nosoos 

A : mamUi d tbo saao vatatA fmV^^wr^^ 



